Dynamic Programming and Suboptimal Control: A Survey from ADP to MPC

نویسنده

  • Dimitri P. Bertsekas
چکیده

We survey some recent research directions within the field of approximate dynamic programming, with a particular emphasis on rollout algorithms and model predictive control (MPC). We argue that while they are motivated by different concerns, these two methodologies are closely connected, and the mathematical essence of their desirable properties (cost improvement and stability, respectively) is couched on the central dynamic programming idea of policy iteration. In particular, among other things, we show that the most common MPC schemes can be viewed as rollout algorithms and are related to policy iteration methods. Furthermore, we embed rollout and MPC within a new unifying suboptimal control framework, based on a concept of restricted or constrained structure policies, which contains these schemes as special cases.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Programming and Suboptimal Control: A Survey from ADP to MPC1

We survey some recent research directions within the field of approximate dynamic programming (ADP), with a particular emphasis on rollout algorithms and model predictive control (MPC). We argue that while motivated by different concerns, these two methodologies are closely connected, and the mathematical essence of their desirable properties (cost improvement and stability, respectively) is co...

متن کامل

64c Approximate Dynamic Programming Based Strategy for Markov Decision Problems in Process Control and Scheduling

Most interesting problems in process control and scheduling can be formulated as a Markov Decision Problem (MDP). This includes real-time decision problems (e.g., feedback control and information-based rescheduling) that involve significant amounts of stochastic uncertainty. Optimal policy for MDPs can be derived by solving an associated stochastic dynamic programming (DP) problem. However, the...

متن کامل

Navigation In GPS-Denied Environments Using Approximate Dynamic Programming

Controlling a mobile vehicle to navigate in GPS-denied environments introduces a challenging partially observable control problem with complex constraints. This report presents a combination of various suboptimal control schemes such as open loop feedback control (OLFC), certainty equivalent control (CEC), model predictive control (MPC), and using expected values of estimates as full states to ...

متن کامل

From robust model predictive control to stochastic optimal control and approximate dynamic programming: A perspective gained from a personal journey

Developments in robust model predictive control are reviewed from a perspective gained through a personal involvement in the research area during the past two decades. Various min–max MPC formulations are discussed in the setting of optimizing the “worst-case” performance in closed loop. One of the insights gained is that the conventional open-loop formulation of MPC is fundamentally flawed to ...

متن کامل

Online set-point optimisation cooperating with predictive control of a yeast fermentation process: A neural network approach

Online set-point optimisation which cooperates with model predictive control (MPC) and its application to a yeast fermentation process are described. A computationally efficient multilayer control system structure with adaptive steady-state target optimisation (ASSTO) and a suboptimal MPC algorithm are presented in which two neural models of the process are used. For set-point optimisation, a s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Eur. J. Control

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2005